Inventi Impact: Audio, Speech & Music Processing

Articles

Inventi:easm/18664/16

Small-parallel Exemplar-based Voice Conversion in Noisy Environments Using Affine Non-negative\nMatrix Factorization

01-Jan-1970 Research 2016 : April - June

Ryo Aihara, Takao Fujii, Toru Nakashika, Tetsuya Takiguchi, Yasuo Ariki

The need to have a large amount of parallel data is a large hurdle for the practical use of voice conversion (VC). This paper presents a novel framework of exemplar-based VC that only requires a small number of parallel exemplars. In our previous work, a VC technique using non-negative matrix factorization (NMF) for noisy environments was proposed. This method requires parallel exemplars (which consist of the source exemplars and target exemplars that have the same texts uttered by the source and target speakers) for dictionary construction. In the framework of conventional Gaussian mixture model (GMM)-based VC, some approaches that do not need parallel exemplars have been proposed. However, in the framework of exemplar-based VC for noisy environments, such a method has never been proposed. In this paper, an adaptation matrix in an NMF framework is introduced to adapt the source dictionary to the target dictionary. This adaptation matrix is estimated using only a small parallel speech corpus. We refer to this method as affine NMF, and the effectiveness of this method has been confirmed by comparing its effectiveness with that of a conventional NMF-based method and a GMM-based method in noisy environments.

How to Cite this Article
CC Compliant Citation: Aihara, Ryo, et al. \"Small-parallel exemplar-based voice conversion in noisy environments using affine nonnegative\nmatrix factorization.\" EURASIP Journal on Audio, Speech, and Music Processing 2015.1 (2015): 1-9, DOI 10.1186/\ns13636-015-0075-4, http://creativecommons.org/licenses/by/4.0/.
Download Full Text

Call Us: +4 (800) 888-0008

Inventi Impact: Audio, Speech & Music Processing

Articles

Inventi:easm/18664/16

Small-parallel Exemplar-based Voice Conversion in Noisy Environments Using Affine Non-negative\nMatrix Factorization

How to Cite this Article

Links

Contact Us